Generalization from Observed to Unobserved Features
نویسنده
چکیده
The thesis discusses various aspects of learning from a subset of features to another feature subset in the framework of machine learning. We propose feature to feature learning and generalization as a possible model for clustering, as a method to improve feature selection and to incorporate prior knowledge in supervised learning. We also discuss the possible relation of our models to learning in biological neural systems. The most common machine learning model is learning from examples. In this model, the algorithm learns structures from a nite set of observations, called a training set, and uses the learning results to perform some task. In supervised learning each observation is a pair made up of an instance and a label, and the algorithm learns to map between instances to labels. We expect a good learning algorithm to generalize from the nite training set to new unseen instances, and to be able to accurately predict their labels. The instances are typically represented by vectors of measurements, where each measurement is called a feature. For example, in an optical handwritten recognition task, an instance can be an image of the character and the label the name of the character. The features can be the gray level values of each pixel, or functions of these pixels such as strokes. Another prominent learning model is clustering, where each observation contains an instance without a label. Clustering deals with grouping observations based on their similarity. In general, it is not clear how to measure the quality of clustering, since we do not have a well-de ned prediction task as in supervised learning. In some cases, the goal of clustering is to predict some label which is unknown to the clustering algorithm. However, it is unclear how we can optimize the quality of the clustering to an unknown label. Even worse, the clustering quality depends on the speci c choice of the labels. For example, good documents
منابع مشابه
The Contribution of Observed and Unobserved Fundamentals to Exchange Rate Movements in Iran
Using a State-space model, this paper investigates the contribution of both observed and unobserved fundamentals to nominal exchange rate movement in Iran for the period 1991:2-2011:4. To this end, we follow Engel and West (2005) and Balke et al. (2013) and use an asset-pricing approach to develop a rational expectations present value exchange rate model. In order to examine the role of fun...
متن کاملAn integrated account of generalization across objects and features.
Humans routinely make inductive generalizations about unobserved features of objects. Previous accounts of inductive reasoning often focus on inferences about a single object or feature: accounts of causal reasoning often focus on a single object with one or more unobserved features, and accounts of property induction often focus on a single feature that is unobserved for one or more objects. W...
متن کاملAn integrated account of generalization across objects and featuresI
Humans routinely make inductive generalizations about unobserved features of objects. Previous accounts of inductive reasoning often focus on inferences about a single object or feature: accounts of causal reasoning often focus on a single object with one or more unobserved features, and accounts of property induction often focus on a single feature that is unobserved for one or more objects. W...
متن کاملGeneralization in Clustering with Unobserved Features
We argue that when objects are characterized by many attributes, clustering them on the basis of a relatively small random subset of these attributes can capture information on the unobserved attributes as well. Moreover, we show that under mild technical conditions, clustering the objects on the basis of such a random subset performs almost as well as clustering with the full attribute set. We...
متن کاملImproving envelopment in data envelopment analysis by means of unobserved DMUs: an application of banking industry
In data envelopment analysis, the relative efficiency of a decision making unit (DMU) is defined as the ratio of the sum of its weighted outputs to the sum of its weighted inputs allowing the DMUs to freely allocate weights to their inputs/outputs. However, this measure may not reflect a the true efficiency of a DMU because some of its inputs/outputs may not contribute reasonably in computing t...
متن کاملBayesian Identification, Extraction and Forecasting of Unobserved Components for Time Series in the Frequency Domains
This work aims to present a full Bayesian framework to identify, extract and forecast unobserved components in time series. The major novelty is to present a probabilistic framework to analyze the identification conditions. More precisely, informative prior distributions are assigned to the spectral densities of the unobserved components. This entails a interesting feature: the possibility to a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008